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(57) A gene of a protein having an enzymatic activ- 
ity enabling the control or regulation of the contents of 
saturated and unsaturated fatty acids in plant cells and 
the enzyme protein as the expression product of this 
gene, more specifically: proteins showing a p-ketoacyl- 
ACP-synthetase II (KAS II) activity and having a specific 
amino acid sequence typified by the amino acid 



sequence of KAS II originating in blue-green algae 
(Anacystis nidulans ) or an amino acid sequence sub- 
stantially equivalent thereto; a KAS II protein gene 
encoding the amino acid sequence of the above protein; 
and recombinant vectors containing this gene and cells 
having this gene transferred thereinto. 
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Description 

Technical Field 

5 [0001 ] The present invention relates to the amino acid sequence of a synthetase responsible for the synthesis of fatty 
acids in plants and the structure of DNA related to the same, that is to say, a fatty acid synthesizing enzyme protein hav- 
ing a specific amino acid sequence, and a gene coding for it. Cells can be transformed with such a gene and chimera 
genes in which an appropriate regulatory sequence (regulatory gene) has been inserted to control the amounts of sat- 
urated and unsaturated fatty acids in the cell. 

10 

Background Art 

[0002] Fatty acid synthases are known to be divided into two types; the enzymes in animals and yeasts are fatty acid 
synthetase complexes (FAS) in which a variety of enzymes are wholly linked as a complex having a single function (type 

75 I), while those in higher plant cells and procaryotes are of such type that each of the enzymes become independently 
disconnected outside the organisms (type II). An acyl carrier protein (ACP) which is a soluble protein is required for the 
synthesis of a fatty acid with the enzyme type II, and the fatty acids are synthesized as an acyl-ACP. The final product 
of the synthesis system is palm itoyl- ACP. The palmitoyt-ACP is further converted into stearoyl-ACP by chain elongation 
before desaturation with a soluble fatty acid desaturase (stearoyl-ACP desaturase) to lead to oleoyl-ACP. Palmitic acid 

20 and oleic acid are incorporated into polar lipid, and then the latter is further desaturated (J. Ohlrogge and J. Browse 
(1995) Lipid Biosynthesis. The Plant Cell, 7, p. 957-970). 

[0003] The chain elongation enzyme which catalyzes the chain elongation from palmitoyl-ACP to stearoyl-ACP pro- 
duces stearoyl-ACP from palmitoyl-ACP. malonyi-ACP and NADPH (J. Ohlrogge and J. Browse (1995) Lipid Biosynthe- 
sis. The Plant Cell, 7, p. 957-970). These reactions describe the total scheme of a series of enzyme reactions for the 
25 production of a stearoyl-thioester by the reduction, dehydration and further reduction of the condensation product of a 
palmitoyl-thioester and the C2 unit. 

[0004] Lipid biosynthesis in plants has been studied very extensively (Browse et al.. Annu. Rev. Plant Physiol. Mol. 
Biol. (1 991 ) 42: 467-506). It has been elucidated from these researches that the production of stearic acid starting from 
palmitic acid in plant cells is a reaction catalyzed by the enzyme p-ketoacyl-ACP synthetase II (KASII). However, there 
30 has been described in the aforementioned publication the isolation of neither the enzyme^KASII or its gene, and thus 
their sequences remain unknown. 

[0005] Three isozymes of KAS have been found in chloroplasl in plants. Among the two isozymes other than KASII, 
the isozyme KASUI catalyzes the initiation of the synthesis of an acyl chain, while the isozyme KASI catalyzes the elon- 
gation reaction of an acyl chain to the palmitoyl-ACP with 16 carbon atoms. A variety of mutants of enzymes involved 

35 in the lipid synthesis of plants have been isolated from Arabidopsis, among which the enzymes responsible for the 
desaturation reaction have been studied extensively. There have also been described tor the KASII, the mutant of which 
has been designated as fabl . In this mutant, the KASII enzyme activity was lowered to 65%, and thus the palmitic acid 
content increased by 7% in leaves and 3% in roots (Wu et al., Plant Physiol. (1994) 106: 143-150). As regards the com- 
plete purification of the enzyme KASII, genes have been cloned from a castor bean (Ricinus communis; Japanese Pat- 

40 ent Laid-Open Publication No. 500234/1994) and soybean (Glycine max; Japanese Patent Laid-Open Publication No. 
501 446/1 995) on the basis of the amino acid sequences of the limitedly degraded peptide of the purified enzyme. In the 
above described publications, as regards the changes of fatty acids by transformation with these genes, the Ci 6 fatty 
acid content on the expression of the gene of the castor bean in E. £0|j was decreased by ca. 20% thus corresponding 
to a little over 30% of the total fatty acid content, while on introducing the soybean gene into canola the palmitic acid 

45 content in the seeds was decreased by 0.8% and on introducing the gene into tobacco the palmitic acid content in the 
leaves was decreased by ca. 2%. In this connection, no sequence exhibiting distinct homology has been curiously 
found between the genes of the castor bean and soybean. 

[0006] By the way, it has been known that in membrane lipids constituting biomembrane, the phase transition temper- 
ature varies primarily depending on the unsaturation degrees of the fatty acid linked to the lipid, and as a result the chill- 
so ing resistance of the organism also varies. It is thought that the unsaturation degree of the membrane lipid is effectively 
increased with an enzyme such as fatty actd acyltransferase (PCT/JP 92/00024 (PCT/WO 92/13082)), fatty acid desat- 
urase (PCT/JP 94/02288 (PCT/WO 95/18222)). 

Disclosure of the Invention 

55 

[0007] In consideration of the above described situations, the object of the present invention is to provide a gene of a 
protein having an enzyme activity which makes it possible to regulate or control the content of saturated fatty acids and 
unsaturated fatty acids in plant cells or microorganism cells and an enzyme protein as the expression product. 
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[0008] It is believed that if there is a protein having such an enzyme activity that the decrease of the enzyme activity 
responsible for the synthesis of fatty acids leads to the increase of the palmitic acid content in lipids of cells while the 
increase of enzyme activity leads to the increase of content of the fatty acids with 18 or more carbon atoms, the unsatu- 
rated fatty acid contents in the lipids is possibly increased for example as a result of the increase of content of the fatty 

5 acids with 18 or more carbon atoms due to the increase of the enzyme activity 

[0009] The present inventors have conducted earnest researches in order to solve the above described problems, 
and as a result, successfully isolated a gene which codes for an enzyme p-ketoacyl-ACP synthetase II (KASII) from 
cyanobacterium (Anacvstis nidulans) . and found that the introduction of the gene into E coji confers the KASII produc- 
ing ability whereby fatty acids having extended in chain length increase. The present invention has been accomplished 

io on the basis of the finding. 

[001 0] That is, the present invention relates to the protein which has an amino acid sequence represented by SEQ 
ID NO. 2 or substantially the same amino acid sequence as the one represented by SEQ ID NO. 2 and exhibit the KASII 
enzyme activity. 

[001 1 ] The present invention also relates to the KASII enzyme gene coding for the protein which has an amino acid 
75 sequence represented by SEQ ID NO. 2 or substantially the same amino acid sequence as the one represented by 
SEQ ID NO. 2 and exhibit the KASII enzyme activity 

[001 2] Furthermore, the present invention relates to the recombinant vector containing the gene and the cells in which 
the gene has been introduced. 

20 Brief Description of the Drawings 

[0013] 

Fig. 1 shows the amino acid sequence represented by single-letter codes corresponding to SEQ ID NO. 2 (three- 
25 letter codes amino acid). 

Fig. 2 illustrates the comparative chart of the amino acid sequence of barley (Accession No. P23902 in SWISS- 
PROT data base) KASI enzyme with the amino acid sequence expected from the DNA sequence of the castor bean 
(GENBANK Accession No. L13241) KASII enzyme. 

Fig. 3 illustrates the comparative chart of the amino acid sequence of the enzyme KASII (referred to as KASII in the 
30 drawing) derived from Anacvstis nidulans in the present invention with the amino acid sequence (referred to as Fab) 
of fabF or J (code Si 1 1069) of Synechocystis sp. Strain PCC6803. 

Best Mode for carrying out the Invention 

35 [0014] The present invention is now explained in detail as follows. 

KASII enzyme active protein and its gene 

[001 5] As described above, the protein of the present invention having the KASII enzyme activity has the amino acid 
40 sequence represented by SEQ ID NO. 2 (corresponding to the amino acid sequence (single-letter code) in Fig. 1) or 
substantially the same amino acid sequence as the one represented by SEQ ID NO. 2, and the gene of the KASII 
enzyme active protein according to the present invention codes for the above described protein having the amino acid 
sequence represented by SEQ ID NO. 2 or substantially the same amino acid sequence as the one represented by 
SEQ ID NO. 2. The phraseology "protein having the KASU enzyme activity" or "KASII enzyme active protein" in the 
45 present invention means the protein having an enzyme activity which produces a longer fatty acid (particularly stearic 
acid) by extending the chain length of a fatty acid (particularly palmitic acid). 

[0016] As the KASII enzyme active proteins described above, there can be used in the present invention an appro- 
priate naturally occurring gene products as well as mutant gene products in which a part of the amino acid sequences 
of the proteins has been mutated, provided that the proteins have the above described KASII enzyme activity By way 
so of example, the product of the KASII enzyme active protein gene typically includes the KASII enzyme derived from the 
cyanobacterium as a microorganism (SEQ ID NO. 2). 

[0017] The phrase "substantially the same amino acid sequence" in the present invention means that the sequence 
of the mutant described above is also included, such as typically the amino acid sequence of the enzyme protein 
derived from the cyanobacterium represented by SEQ ID NO. 2 (SEQ ID NO. 2, Fig. 1) or the sequence in which one 
55 or more, preferably one or a few amino acids have been substituted, deleted, inserted or added. 

[0018] Therefore, the term "substantially" in the case of "the gene coding for ... substantially the same amino acid 
sequence" in the present invention is intended to include not only the gene of a DNA sequence coding for the naturally 
occurring protein having the KASII enzyme activity defined above, but also the gene of a DNA sequence coding for the 
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mutant KASII enzyme active protein described above, typically the gene of the DN A sequence coding for the amino acid 
sequence represented by SEQ ID NO. 2 or the amino acid sequence in which one or more, preferably one or a few 
amino acids are substituted, deleted, inserted or added. Also, it goes without saying that when a DNA chain generally 
codes for a polypeptide having an amino acid sequence, plural gene codes (codons) corresponding to an amino acid 
5 are present (degenerated mutants), and thus any gene codes can be used also in the DNA chain coding for the KASII 
enzyme active protein of the present invention. 

[001 9] The KASII enzyme active protein encoded by the gene of the present invention has a function of the chain elon- 
gation enzyme for the fatty acid synthesis which is originally present in plants and microorganisms as described above, 
and thus, in the further specific function, has an enzyme activity for producing a longer tatty acid (particularly, stearic 

10 add) that the chain length of a fatty acid (particularly, palmitic acid) has been elongated. The typical example of the pro- 
tein according to the present invention is the one derived from cyanobacterium. The chemical structure of the enzyme 
KASII derived from the cyanobacterium is locally similar to the protein encoded by the KASI gene of E. coli and barley, 
and also similar to the enzyme derived from the castor bean among the patent publications regarding to the above 
described KASII. The KASII enzyme active protein according to the present invention has. as described above, the 

75 amino acid sequence represented by SEQ ID NO. 2 or substantially the same amino acid sequence as the above 
described one, and a markedly high enzyme activity for producing a longer fatty acid (particularly, stearic acid) in which 
the chain length of a fatty acid (particularly, palmitic acid) has been elongated. 

[0020] While a means for obtaining the gene coding for the protein of the present invention is the chemical synthesis 
of at least a part of the chain according to the method of nucleic acid synthesis, it is more preferable in consideration of 
20 the number of the amino acids to be linked to use the method, rather than the chemical synthesis, in which cDNA is 
synthesized from mRNA isolated from a naturally occurring material, in particular cyanobacterium as a bacterium, and 
the gene is obtained from the gene library by the method commonly used in the field of genetic engineering. 
[0021] The gene oi the enzyme KASII can be obtained for example as follows. 

[0022] Firstly, the enzyme from a higher plant or a microorganism, particularly cyanobacterium is purified by the 
■25 known method, and fragmented with peptidase to determine the amino acid sequences of the fragments. Oligonucle- 
otides corresponding to the fragmented peptides whose amino acid sequence has been determined are then synthe- 
sized. The total RNA is separately extracted from the plant or the microorganism, and the DNA complementary to the 
RNA (cDNA) is synthesized. The cDNA is linked to an appropriate vector such as phage Agtl 1 to make the cDNA 
library In this connection, as the method for screening the gene, the conventional methods, for example immunological 
30 methods such as the plaque hybridization method with antibody or the colony hybridization method, or the hybridization 
method with nucleotide probe or the like can be used. 

[0023] It is also possible to obtain the target sequence by designing primers corresponding to the short DNA 
sequences positioned at both of the ends of the aimed sequence on the basis of the consensus sequence of a known 
KASII enzyme or of the other isozyme related to it, and conducting PCR with DNA obtained from a material used for 
35 determining the total sequence as a template. In this case, the activity can be identified for example by expressing the 
KAS gene in E. coli in order to discriminating the isozyme of the gene. 

[0024] The DNA sequence of the gene according to the present invention in the clone thus screened can be deter- 
mined and confirmed generally by the known methods such as the dideoxynucleotide chain terminating method with 
Mi 3 phage (Sambrook et al., Molecular Cloning, 2nd edition (1989)). 

40 [0025] The present gene of which DNA sequence has been determined as described above can be also synthesized 
generally by the known means, for example a commercially available DNA synthesizer by the phosphite method. 
[0026] Also, for expressing a DNA chain or its fragment to produce a protein or a polypeptide encoded thereby, an 
expression regulatory sequence is required in addition to the DNA sequence (coding region) corresponding to the 
amino acid sequence. Thus, the DNA chain of the present invention includes the DNA sequence comprising such 

45 expression regulatory sequence. Among the expression regulatory region, an important one particularly for expressing 
it in a higher plant is the promoter sequence upstream of the coding region (e.g. derived from the 35S promoter of cau- 
liflower mosaic virus), and the poly A addition signal downstream (e.g. derived from the terminator of nopaline synthesis 
enzyme). When DNA obtained is the genome gene of a higher plant, it can also be used directly provided that the DNA 
sequence comprises expression regulatory region. 

50 

Use of KASII enzyme active protein gene 

[0027] As described above, the present invention relates also to the recombinant vector comprising the above 
described DNA chain or its fragment, and to cells into which the gene has been introduced. 
55 [0028] The recombinant vector is a vector to which the above described DNA chain or its fragment has been linked, 
and there can be used the known vector such as plasmid (e.g. pETl7b), phage (e.g. *2APII). 
[0029] The enzyme KASII can be produced in a host such as an appropriate plant or microorganism cells as 
described above by introducing the recombinant vector DNA into the host for expression. 
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[0030] In this connection, although the cells may be either microorganism cells or plant cells irrespective of the kind 
of organisms, the microorganism cells include E. coli and the like, and the plant cells include chilling sensitive plants 
such as tobacco and the like. The gene can be introduced into plants generally with the known methods such as the 
ones described in "Plant Molecular Biology Manual, Second Edition; S.G. Gelvin, and R.A. Schilperoort eds, Kluwer 
5 Academic Publishers, 1995*. By way of example, there can be mentioned the biological methods which include a 
method with virus or a method with Agrobecterium, and the physicochemical methods which include the electroporation 
method, the polyethylene glycol method, the particle gun method, and the like. 

[0031] Also, the enzyme KASIl is a protein which is present in chloroplast envelope in plants, so that it is necessary 
to attach the DNA chain coding for transit peptide to the chloroplast upstream the enzyme KASIL By way of example, 
w the small sub-unit gene of ribulose-1 ,5-bisphosphate carboxylase of pea can be used as a gene coding for the transit 
peptide. 

[0032] The KASIl enzyme active protein of the present invention, typically the gene coding for the KASIl enzyme 
active protein (amino acid SEQ ID NO. 2) derived from the cyanobacterium Anacvstis nidulans (the DNA sequence of 
the gene derived from the cyanobacterium is represented by SEQ ID NO. 1) is useful for the improvement of lipid com- 
75 position in plants and microorganisms by transformation, particularly the control of the amount ratio between fatty acids 
with 16 and 18 carbon atoms. 

[0033] The expression of the KASIl enzyme protein of the present invention as a foreign protein in an organism leads 
to the elongation of the chain length of fatty acids from 16 carbon atoms (palmitic acid) to the 18 carbon atoms (stearic 
acid), the stearic acid is desaturated in an organism, and the content of unsaturated fatty acids is increased. It is 
20 believed that chilling resistance is enhanced in plants in which unsaturated fatty acids have been increased (PCT/WO 
92/13082, PCT/WO 95/18222), and it is expected that the resistance to the stress of culture is enhanced in microorgan- 
isms (yeast, E^ cell and the like) in which unsaturated fatty acids have been increased. 

Examples 

25 

[0034] The present invention is described below in detail with reference to examples, it is not to be limited by these 
examples. 

Example 1 Preparation of DNA derived from A^ nidulans . and preparation of DNA library 

30 x 

[0035] A. nidulans (Catalog No. 1AM M-6; it is possible to obtain it from Institute of Molecular Cytology, Tokyo Univer- 
sity) was cultured in about 100 ml of the BG-1 1 culture medium prepared according to the method described in p. 279 
of Plant Molecular Biology, by Shaw (IRL PRESS, 1988). The bacterial cells were cultured sufficiently by shaking at 120 
times/min under a fluorescent light of 1000 lux at 25°C. The cells were recovered by centrifugation at 5,000 g for 10 min. 
35 at room temperature. 

[0036] In order to isolate DNA, the precipitated ceils were suspended in 50 ml of 50 mM Tris CI (pH 8.0), 1 mM EDTA 
(solution A) and washed by centrifugation again. The cells were then re-suspended again in 15 ml (solution) of 50 mM 
Tris CI (pH 8.0), 20 mM EDTA, 50 mM NaCl, 0.25 M sucrose (Solution B), to which 40 mg of lysozyme (Sigma) dissolved 
in Solution B was added, and the mixture was snaked slowly at 37°C. After 1 hour, 15 mg of proteinase K and SDS at 

40 a final concentration of 1% were added, and the mixture was shaked slowly over night at 37°C . Next day, NaCI0 4 was 
adjusted to a concentration of 1 M, 20 ml of chloroform/isoamyl alcohol (24 : 1) was added, the mixture was shaked 
slowly for 10 minutes, and the aqueous layer was separated by centrifugation. After extraction with chloroform/isoamyl 
alcohol was repeated once again, 50 ml of ethanol was added, and DNA was recovered by winding it around a glass 
rod. The DNA was dissolved in 20 ml of solution A, NaCl was adjusted to a concentration of 0.1 M, RNase at a concen- 

45 tration of 50 mg/ml was added and the reaction was conducted at 37°C for 1 hour. The reaction mixture was then sub- 
jected to an extraction twice with an equivalent amount of phenol saturated with solution A. After DNA in the aqueous 
layer was recovered by the addition of ethanol and washed with 70% ethanol, it was dissolved in 1 ml of solution A to 
prepare the DNA solution. 

[0037] After partial digestion of ca. 100 g of DNA with Sau 3A I for the purpose of preparing a genomic DNA library 
so from DNA thus obtained, DNA of about 9 - 23 kb was collected by ultracentrifugation on a gradient of sucrose density 
according to the method described by Sambrook et a!. It was cloned into DASH II (kit by Stratagene) cleaved with Bam 
HI and Hind III. 

Example 2 Cloning of KASIl enzyme-iike gene from cyanobacterium Anacystis nidulans 

£5 

[0038] Several short DNA chains were synthesized by comparing the enzyme KASI of barley with the enzyme KASIl 
of castor bean while paying attention to the regions having high homology between these enzymes (Fig. 2). Among 
these chains, distinct bands in accordance with expected sizes were observed in reactions carried out with the following 
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combination. 

1 : 5'-CC( ACGT)CC(AG)AA(ACGT)CC(AG)AA(ACGT)GA( AG)TT3' (SEQ ID NO. 3) 
2: 5 , -GA(AG)GA(AG)GT(ACGT)AA(CT)TA(CT)AT(ACT)AA(CT)GC-3' (SEQ ID NO. 4) 

5 

[0039] Among the sequences, SEQ ID<NO. 4 is a sense primer corresponding to the amino acid sequence 
EEVNYINA, and SEQ ID NO. 3 is a primer coding for the anti-sense chain corresponding to the amino acid sequence 
NSFGFGG. The PCR reactions were carried out with the sense and anti-sense primers. The reaction was performed 
under a condition of using a GeneAmp™ PCR kit (Takara Shuzo Co., Ltd.) by adding in 100 jil of reaction solution 20 

10 nM of the primers, respectively, and 1 ng of DNA derived from A,, nidulans . The reaction program of 35 cycles was per- 
formed with each cycle comprising the reaction at 95°C (1 minute), 50°C (1 minute) and 72°C (2 minutes), provided that 
only in the first cycle, the reaction at 95°C was extended to 3 minutes and the reaction temperature at 50°C was 
changed into 35°C. After completion of reaction, the reaction mixture was subjected to extraction with 100 jjlI of chlon> 
form to recovered the aqueous layer and chloroform was then removed with 100 \i\ of ether to give an aqueous layer, 

is from which 10 \i\ portion was analyzed by 2% agarose gel electrophoresis. 

[0040] As a result, DNA having the same size as the expected one (ca. 330 bp) was detected. The DN A fragment was 
cloned into a pCRII vector (Invitrogen Co.). The DNA sequence was determined by the dideoxy method with use of a 
fluorescent sequencer (manufactured by Applied Biosystem Co.). The DNA was labeled with 32P-dCTP using Mul- 
tiprime DNA labelling kit (Amersham Co.) to prepare a probe and used for the following experiment of hybridization. 

so [0041] EL co!i P2392 was infected with a phage in the DNA library and ca. 10.000 plaques were formed on a plate 
having a diameter of ca. 15 cm and containing an NZYM medium, which were transferred onto a nylon membrane. 
Hybridization was carried out by the method described by Sambrook et al. (Molecular Cloning; Second edition, Cold 
Spring Harbor Laboratory Press, 1989) under a condition in a solution consisting of 5 x SSC (1 x SSC comprising 0.15 
M NaCI and 15 mM sodium citrate), 10 mM EDTA, 10 x Denhardt solution (50 x Denhardt solution comprising Ficoll 

25 (Type 400, Pharmacia Co.), polyvinyl pyrrolidone, bovine serum albumin (fraction V, Sigma Co.), respectively, in an 
amount of 10 g/i), and 250 u.g/ml of salmon sperm DNA at 60°C for 16 hours. Thereafter, the membrane was washed 
twice with 5 x SSC and 0.1% SDS solution at 45 °C for 15 minutes and subjected to autoradiography. Ten positive 
clones were purified to obtain the phage DNAs, which were cleaved with some restriction enzymes, subjected to agar- 
ose gel electrophoresis followed by Southern blotting on a nylon membrane according to the conventional method. The 

30 membrane was hybridized under the same condition as the above described plaque hybridization to compare the 
hybridization strengths and the DNA fragment lengths. As a result, the two clones of them (XB, F) were believed to be 
satisfactory from the standpoint of both strength and fragment length, and thus cleaved further with some restriction 
enzymes to carry out Southern hybridization. As a result, a hybridizing fragment having a length of about 5 kbp was 
detected by cleaving with Sal I, so that it was subcloned to the Sal I site of pUC 19 (Takara Shuzo Co., Ltd.) (referred 

35 to as pB and pF, respectively). When each clone was subjected in more detail to mapping with restriction enzymes, pB 
and pF were judged as the identical DNA. Thus, a deletion plasmid on the latter was prepared with restriction enzyme 
according to the conventional method, and the base sequence of the DNA chain for about 2 kbp fragment comprising 
the hybridizing DNA fragment was determined with a fluorescent sequencer (SEQ ID NO. 1). An open reading frame 
(ORF) consisting of 1251 bp was found from it, and an amino acid sequence having 417 residues was presumed (SEQ 

40 ID NO. 2). Comparing the amino acid sequence with that of proteins registered to database for homology, it exhibited 
significant homology with a fatty acid synthase. In particular, ft exhibited the highest homology of 74% with the protein 
which was thought to be fabF or J In the total genome sequence for Svnechocvstis sp. strain PCC6803 prepared by 
KA2USA DNA Institute (database DDBJ accession No. D90905, PID; gl652389) (Fig. 3). As regards the other proteins, 
it exhibited homologies of 43 - 46% with the KASIt gene derived from the castor bean and the KASI gene derived from 

45 barley and 35% with KASI derived from E^ coli. These homologies of genes however could not distinguish which of 
KASI, II or III corresponds to the function of the ORF 

Example 3 Measurement of activity of KASII-like gene derived from Anacvstis nidulans in E. coli 

so [0042] In order to specify the function of the above described gene, the expression in E. coli was tried. Firstly, in order 
to remove the excess DNA sequences upstream and downstream ORF, prepared was a DNA in which at the N terminal, 
Nde I site was introduced in the 5' side of the start codon (ATG) by DNA synthesis (Applied Biosystem Co.), while Hind 
111 site was introduced immediately downstream the ORF 

55 1 ; 5'-CGCACATATGACTGAAACCGGACGCC (SEQ ID NO. 5) 

2: S'-CCGCAAGCTTGCAGCAGCGCGTACTGC (SEQ ID NO. 6) 

[0043] PCR reaction was performed with both synthetic DNAs as a primer in the presence of pF as a template DNA. 
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Regarding the reaction condition, the reaction was repeated 30 cycles according to the manual from Perkin-Elmer Co. 
with each cycle comprising the reaction at 94*0 (1 minute), 60°C (1 minute) and 72°C (2 minutes). The reaction product 
was cleaved with Ndel and Hind III, followed by cloning with pET17b preliminarily cleaved with the same restriction 
enzyme set as described above into E co[i strain DH5, which was then cloned into a strain BL21(DE3)pLysS (Nova- 
s gen). 

[0044] The recombinant of a latter E coii strain was cultured (32°C) until the turbidity of the culture solution reached 
0.5 OD at a wavelength of 600 nm in 75 ml of LB medium to which 1 00 ug/ml of ampicillin and 30 jug/ml of chloramphen- 
icol had been added. IPTG was then added at a final concentration of 0.4 mM, and culturing was further continued for 
2 hours. E. coii was recovered from the culture medium by centrifugation at 10,000 x g for 10 minutes, and the cells 

io were washed with 50 mM Tris HCI (pH 7.4) and frozen at -20°C. The cells were thawed in ice with a solution consisting 
of 20 mM Tris HQ (pH 8.0), 20 mM dithiothreithol, 10 mM MgCI 2 and 1 fig/ml Dnase I. The mixture was centrifuged at 
100,000 x g at 4°C for ^ hour, and the protein solution as the supernatant was subjected to SDS electrophoresis on a 
slab gel having a polyacrylamide concentration gradient from 1 0 to 20% followed by dying with Coomassie brilliant blue. 
As a result, the protein derived from Anacvstis nidulans was detected as a protein having a molecular weight of about 

75 50 kDa. 

[0045] As for the fatty acid composition of EL £Q_[i, the fatty acids were recovered for analysis as the methyl esters from 
the cells cultured as described above. Methylation was carried out by heating about 5 mg of lipid together with 1 ml of 
5% hydrochloric acid in anhydrous methanol in a sealed tube in boiling water for 4 hours, the reaction mixture was 
cooled by standing followed by extracting the fatty acid methyl esters with hexane. Methylated fatty acid esters were 
20 analyzed on a capillary column (polyester liquid phase; 10% EGSS-X, 175°C) with a hydrogen flame ionization detec- 
tor. Fatty acids were determined by comparing their relative retention times with those of standard methylated fatty 
acids. The results are listed in the following table. 
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Fatty acid compositions in E coii 


Sample 


14:0 


16:0 


16:1 


18:0 


18:1 


16:0+16:1/18:0+18:1 


Control 


2 


37 


21 


1 


38 


1.49 


Recombinant #1 


0 


24 


13 


6 


57 


0.59 


Recombinant #2 


0 


21 


11 


7 


62 


0.46 



35 [0046] As a result, it has been found that the fatty acids with 1 6 carbon atoms (1 6:0 and 16:1) were decreased, while 
the ratio of the fatty acids with 18 carbon atoms (18:0 and 18:1) was increased substantially 

Industrial Applicability 

40 [0047] The DNA chain coding for protein which has a p-ketoacyl-ACP synthetase II enzyme activity represented by 
the enzyme KASM derived from Anacvstis nidulans has been provided by the present invention. The gene coding for the 
enzyme protein of the present invention, as described above, is a gene of the enzyme KASII which has a remarkably 
high activity of converting a fatty acid (particularly, palmitic acid with C16) into an even longer fatty acid (particularly, 
stearic acid with C18), and is useful, by using transformation, for the improvement of the lipids of plants, the improve- 
rs ment of the lipids of microorganisms, particularly for the control of the ratio between fatty acids having 1 6 and 1 8 carbon 
atoms or for the increase of the content of unsaturated fatty acids. 
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SEQUENCE LISTING 

Applicant: Kirin Beer Kabushiki Kaisha 

Title of the Invention: £-Ketoacyl-ACP Synthetase II Enzymes 

AND Genes Coding for Same 

Docket No.: 113702-432 

Filing Date: January 20, 1998 

Number of Sequences: 6 

SEQ ID NO. : 1 

LENGTH: 1251 base pairs 
TYPE: nucletic acid (DNA) 
STRANDEDENESS : double 

TOPOLOGY: linear 
MOLECULE TYPE: Genomic DNA 
ORIGINAL SOURCE: 

ORGANISM: An a cy St. IS nidulans 

STRAIN: JAM M-6 
SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

ATGACTGAAA CCGGACGCCA GCGTGTTGTT ATTACTGGTT TGGGAGCCAT TACTCCCATC 60 

GGTAATGATC CAACGGAATA TTGGCAGGGA ATCCTTGCCG GTCGCAACGG CATCGATCTG 120 

ATTCGGGGCT TTGATGCGTC TCGTCACGCC TGCAAAATTG CCGGGGAGGT CAAGGACTTT 180 

GACCCCACCC AGTACATGGA CCGCAAGGAT GCTAAGCGGA TGGATCGGTT TGCACAACTG 240 

GCGGTTGCTG CCAGTCGCCA AGCAGTCGCC GATGCCAAGC TGGACATCAC TGAACTGAAT 300 

GCGGATGCGA TCGGGGTGCT GATCGGCTCA GGCATTGGTG GTTTGAGGGT GATGGAGGAC 360 

CAGCAGACGG TTTTGCTGGA AAAAGGCCCC GATCGCTGCA GCCCCTTCAT GGTGCCGATG 420 

ATGATCGCCA ACATGGCGGC AGGACTGACG GCCATCCAGT TGGGTGCCAA AGGCCCTTGC 480 

AATGTCACGG TGACTGCTTG CGCTGCGGGT TCTAATGCGG TGGGTGAAGC CTTCCGGCTG 540 

ATTCAGCACG GCTATGCCCA AGCCATGATC TGTGGCGGAA CTGAATCCTG TGTGACCCCA 600 

CTGGCTATGG CCGGTTTTGC GGCCTGTAAG GCACTGTCGC TGCGCAACGA TGACCCCOCC €60 

CATGCTTGCC GTCCCTTTGA CCAAGGCCGT GATGGTTTTG TGATGGGCGA AGGCGCAGGG 720 

ATTTTGGTCT TGGAATCCTT GGAGCATGCC CAAGCGAGGG GCGCTCACAT CTATGGCGAA 780 

ATCGTCGGCT ATGGCATGAC CTGTGATGCC TATCA CAT CA CCTCGCCGGT CCCAGGTGCT 840 

TTGGGTGCGG CCCGGGCGAT CGAGTTCGGG CTCCGCGATG CCAATCTGCA GCCCAGCCAA 900 

GTCAGCTACA TCAATOCTCA CGGCACCAGC ACACCGGCCA ACGACAGCAC CGAAACGGCA 960 

GCTATTAAGA AAGCCCTAGG TCAGCACGCC TACAAAACCG TGATCAGCTC GACTAAGTCG 1 02 0 

ATGACCGGfrC ACCTGTTAGG GGGCTCCGGC GGAATTGAGG CGGTAGCGGC AACCCTCGCG 1080 

ATCGCTGAGG ACATGGTCCC GCCGACCATT AACCTGGAAG ATCCCGATCC CGATTGCGAC 1140 

TTGGACTATG TCCCCAATCA GGCGCGATCG CTACCGGTGG AAGTGGCTTT GTCCAATTCC 1200 
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TTCGG CTTTG GTGGGCACAA CGTCACGCTG GCCTTCCGGA AATTCCATCC C 1251 

SEQ ID No.: 2 
LENGTH: 417 amino acids 
TYPE : protein 
TOPOLOGY: linear 
MOLECULE TYPE: peptide 
ORIGINAL SOURCE: 

organism: Anacystis rudulans 

75 STRAIN; I AM M-6 

NAME /KEY: CDS 
IDENTIFICATION METHOD: P 
SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Thr Glu Thr Gly Arg Gin Arg Val Val lie Thr Gly Leu Gly Ala 

15 10 15 

He Thr Pro He Gly Asn Asp Pro Thr Glu Tyr Trp Gin Gly He Leu 

20 25 30 

Ala Gly Arg Asn Gly He Asp Leu He Arg Gly Phe Asp Ala Ser Arg 

35 40 45 

His Ala Cys Lys He Ala Gly Glu Val Lys Asp Phe Asp Pro Thr Gin 
30 50 x 55 60 

Tyr Met Asp Arg Lys Asp Ala Lys Arg Met Asp Arg Phe Ala Gin Leu 
65 70 75 80 

Ala Val Ala Ala Ser Arg Gin Ala Val Ala Asp Ala Lys Leu Asp He 

85 90 95 

Thr Glu Leu Asn Ala Asp Ala He Gly Val Leu He Gly Ser Gly He 

100 105 110 

Gly Gly Leu Arg Val Met Glu Asp Gin Gin Thr Val Leu Leu Glu Lys 

115 120 125 

Gly Pro Asp Arg Cys Ser Pro phe Met Val Pro Met Met lie Ala Asn 

130 135 140 

Met Ala Ala Gly Leu Thr Ala He Gin Leu Gly Ala Lys Gly Pro Cys 
145 150 155 160 

Asn Val Thr Val Thr Ala Cys Ala Ala Gly Ser Asn Ala Val Gly Glu 

165 170 175 

Ala Phe Arg Leu He Gin His Gly Tyr Ala Gin Ala Met lie Cys Gly 
180 185 190 
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Gly Thr Glu Ser Cys Val Thr Pro Leu Ala Met Ala Gly Phe Ala Ala 

195 200 205 

Cys Lys Ala Leu Ser Leu Arg Asn Asp Asp Pro Ala Bis Ala Cys Arg 

210 215 220 

Pro Phe Asp Gin Gly Arg Asp Gly Phe Val Wet Gly Glu Gly Ala Gly 
225 230 235 240 

lie Leu Val Leu Glu Ser Leu Glu His Ala Gin Ala Arg Gly Ala His 

245 250 255 

lie Tyr Gly Glu* lie Val Gly Tyr Gly Met Thr Cys Asp Ala Tyr His 

260 265 270 

lie Thr Ser Pro Val Pro Gly Gly Leu Gly Ala Ala Arg Ala He Glu 

275 280 285 

Phe Gly Leu Arg Asp Ala Asn Leu Gin Pro Ser Gin Val Ser Tyr He 

290 295 300 

Asn Ala His Gly Thr Ser Thr Pro Ala Asm Asp Ser Thr Glu Thr Ala 
305 310 315 320 

Ala He Lys Lys Ala Leu Gly Glu His Ala Tyr Lys Thr Val lie Ser 

325 330 335 

Ser Thr Lys Ser Met Thr Gly His Leu Leu Gly Gly Ser Gly Gly He 

340 345 350 

Glu Ala Val Ala Ala Thr Leu Ala He Ala Glu Asp Met Val P*^> ?r© 

355 360 365 

Thr He Asn Leu Glu Asp Pro Asp Pro Asp Cys Asp Leu Asp Tyr Val 

370 375 380 

Pro Asn Gin Ala Arg Ser Leu pro Val Glu Val Ala Leu Ser Asn Ser 
385 390 395 400 

Phe Gly Phe Gly Gly His Asn Val Thr Leu Ala Phe Arg Lys Phe His 
405 410 415 

Pro 
417 



45 SEQ ID NO. : 3 

LENGTH: 20 base pairs 
TYPE i nucleic acid (DNA) 
STRANDEDNESS : double 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid (synthetic DNA) 
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SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



5'-CC(ACGT)CC(AG)AA(ACGT)CC(AG)AA(ACGT)GA(AG)TT-3 
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SEQ ID No,: 4 
LENGTH: 23 base pairs 
TYPE: nucleic acid (DNA) 
TOPOLOGY : linear 

MOLECULE TYPE: other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

5 ' ~GA ( AG ) GA ( AG ) GT { ACGT ) AA ( CT ) TA t CT ) AT ( ACT ) AA ( CT ) GC— 3 ' 23 

SEQ ID No. : 5 
LENGTH : 26 base pairs 
TYPE: nucleic acid (DNA) 
TOPOLOGY ; linear 

MOLECULE TYPE: other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

5 ' -CGCACATATGACTGAAACCGGACGCC-3 ' 2 6 

SEQ ID No.: 6 
LENGTH: 27 base pairs 
TYPE: nucleic acid (DNA) 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

5 ' -CCGCAAGCTTGCAGCAGCGCGTACTGC-3 ' 27 



Claims 

1. A protein which has an amino acid sequence represented by SEQ ID NO. 2 or substantially the same amino acid 
sequence as the one represented by SEQ ID NO. 2 and has the p-ketoacyl-ACP synthetase II enzyme activity. 

2. A protein according to Claim 1 , wherein substantially the same amino acid sequence is the one in which one or 
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more amino acids are substituted, deleted, inserted or added. 

A protein according to Claims 1 or 2, wherein the amino acid sequence is derived from cyanobacterium. 

A gene coding for a protein which has an amino acid sequence represented by SEQ ID NO. 2 or substantially the 
same amino acid sequence as the one represented by SEQ ID NO. 2 and has the p -ketoacyl-ACP synthetase II 
enzyme activity. 

A p-ketoacyi-ACP synthetase II enzyme active protein gene according to claim 4, wherein substantially the same 
amino acid sequence is the one in which one or more amino acids are substituted, deleted, inserted or added. 

A gene according to Claims 4 or 5, wherein the protein encoded is derived from cyanobacterium. 

A recombinant vector comprising the p-ketoacyl-ACP synthetase II enzyme active protein gene according to any 
one of Claims 4 -6. 

A cell in which the p-ketoacyl-ACP synthetase II enzyme active protein gene according to any one of Claims 4 -6 
has been introduced. 
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MTETGRQRW ITGLGAITPI GNDPTEYWQG 
DPTQYMDRKD AKRMDRFAOL AVAASRQAVA 
OQTVLLEKGP DRCSPFMVPM Ml ANMAAGLT 

I QHGYAQAM I CGGTESCVTP LAMAGFAACK 
ILVLESLEHA QARGAHIYGE IVGYGWTCDA 
VSYiNAHGTS TPANDSTETA AlKJCALGEHA 
IAEOMVPPTI NLEDPOPDCO LDYVPNQARS 



ILAGRNGIDL IRGFDASRHA CK I AGEVKOF 
DAKLDITELN ADAIGVLIGS GIGGLRVMED 
AIQLGAKGPC NVTVTACAAG SNAVGEAFRL 

ALSLRNDDPA HACRPFDOGR DGFVMGEGAG 
YHITSPVPGG LGAARAIEFG LRDANLQPSO 
YKTVISSTKS MTGHLLGGSG GIEAVAATLA 
LPVEVALSNS FGFGGHNVTL AFRKFHP 



FIG. I 
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10 20 30 40 50 
BARLEY UHAHAAHALGLRVPPPAFPRRRARPRRR — PAAAVLATSAAPORE TOP — RKRW 

CASTOR - OIL PLANT PCSKYYSSNGLFP^LU'KRHP^^ 

80 90 100 110 120 130 

60 70 80 90 100 110 

BARLEY I TGUGLASVFGSOVDTFYDRLLA6ESGVGP I DRFOASSFPTRFAG0 1 RGFSSEGY I DGKM 

CASTOR- OIL PLANT VTGUGWSPLGHD I DVYYNNLLD6SSG I SO I DSFDCAQFPTR I AGE I KSFSTDGWVAPKL 

140 150 160 170 1B0 190 

120 130 140 150 160 170 

BARLEY DRRLDDC I RYC I LSOOCALESAGLGAGSDAHVKLDVGRAGVLVGTGMGGLSVFSOGVOhlL 

CASTOR - OIL PLANT SKRUOJOTJLYULTAGKKAL^DGG I — TEDMMDELDKARCGVL 1 GSAUGGUKVFNDA I EAL 

200 210 220 230 240 

180 190 200 210 220 230 

BARLEY I EKGYRK ! SPFF I PYA ! TNWGSALU I OVGFMGPNYS I STACATSNYCFYAAANH I RRGE 

CASTOR -OIL PLANT -R I SYRKJMTXT/PFATTNMGS^ I STACATSNFC ! LNAANH I IRGE 

250 260 270 280 290 300 

240 250 260 270 v 280 290 

BARLEY AD I I VAGGTEAA 1 1 P I GLGGFVACRALSGRNDDP I TACRPWDKERDGFVUGEGAGVLVUE 

CASTOR -OIL PLANT AD 1 ULCGGSDAA I I P I GLGGFVACRALSORhODPTXASRPWL^ GAGVULLE 

310 320 330 340 350 360 

300 310 320 330 340 350 

BARLEY SLEHAMKRDAP 1 1 AEYLGGAN^AYHliTDPRADGLGVSSC I TUSLRDAGVAPEEVNY I N 

CASTOR-OIL PLANT ELEHAKKRGAN I YAEFLGGSFTCDAYHWTEPRPDGVGV I LC J EKALARSGVSKEEVNY I N 

370 380 390 400 410 420 

360 370 3B0 390 400 410 

BARLEY AHATSTLAGDLAEVRA I KOVFKNPSE I K I NSTKSU 1 GHCLGAAGGLEA 1 AT I ICS i TTGWV 

CASTOR -OIL PLANT AI^TSTPAGOLKFYEAUilRCFSONPDLflVNSTKSiyi I GHLL6AAGAVEA I AT I OA I RTGW 

430 440 450 460 470 480 

420 430 440 450 460 

BARLEY HPT I NQFNPEPEVDFDTVAN9CKQH-EWVG I SNSFGFGGHNSVWFAPFK 

CASTOR- OIL PLANT HPN I NLEfPEEGVDTKVLVGPKKERLDI KVALSNSFGFGGHNSS 1 1 FAPYK 

490 500 510 520 530 

F I G. 2 
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10 20 30 40 50 60 

Fab MAhLEKKRVVYTGtGA I TP I GWTLOOYWQGLUEGRNG I GP I THFOASDQACRFGGEVKOF 

•*■ • ••)(■••*••»•••••>•• »••••••■*•»•>» « « • • • ••••**•**•*• 

KAS I I MTETGRCKVV I TGLGA I TP I GNDPTEYWQG ( LAGRNG I DL I RGFDASRHACK I AGEVKDF 

10 20 30 40 50 » 

70 BO 90 100 110 120 

Fab DATQFLDRKEAKP«Df^CHFAVCASQQA I NDAKIV ! NELNADE IGVUGTGI GGLKVLED 



KAS 1 1 0PTQYWDRKDAKRMDRFAQUVMSRQAVADAKU3 1 TELNADA IGVLIGSGI GGLRVWEO 

70 80 90 100 110 120 

130 140 1 50 160 170 160 

Fab OQTI LLDKGPSRCSPFU 1 PMM ! ANUASGLTA I NLGAKGPNNCTVTACAAGSNA I GOAFRL 



KAS ( ! OQTVU^CPORCSPFMVPWU I ANWAAGLTA I QLGAKGPCNVTVTACAAGSMAVGEAFRL 

130 140 150 160 170 180 

190 200 210 220 230 240 

Fab VQNGYAKAU ! CGGTEAA I TPLSYAGFASARALSFFMEPUiASFP^ 



KAS 1 1 t OHGYAQAU I CGGTESCVTFi-AMAGFMCKAlJSLRNOOPAHACRPFO^ 

190 200 210 220 230 240 

250 260 270 280 290 300 

Fab 1 L 1 LEELESAIARGAK t YGEtfVGY AMTCDAYH I TAPVPDGRGATRA i AWALKDSGLKPEM 



KASI I I LVLESLEHAQARGAH I YGE ! VGYGUTCDAYH I TSPVPGGLGAARA I EFGLRDANIOPSQ 

250 260 270 280 290 300 

310 320 330 340 350 360 

Fab YSY I NAHGTSTP ANOVTETRA 1 KQALGNHAYN I AVSSTKSUTGHLLGGSGG ! EAVATVMA 



KAS 1 1 VSY I NAHGTSTPANDSTETAA I KKALGEHAYKTV I SSTKStfTGHLLGGSGG I EAVAATLA 

310 320 330 340 350 360 

370 380 390 * 400 410 

Fab 1 AEDKVPPT I NLENPOPECDLDYVPGQSRAL ! VDVALSNSFGFGGHNVTLAFWCYO 

::;::x. :.. 

KAS 1 1 |AEDtfvmiM-ED^ 

370 380 390 400 410 
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